HUTP-04/A022 



The shape of non-Gaussianities 



Daniel Babich^'^, Paolo Creminelli^ and Matias Zaldarriaga^'^ 

Jefferson Physical Laboratory, 
Harvard University, Cambridge, MA 02138, USA 

^ Harvard- Smithsonian Center for Astrophysics 
Cambridge, MA 02138, USA 



Abstract 

We study the dependence on configuration in momentum space of the primordial 3-point function 
of density perturbations in several different scenarios: standard slow-roll inflation, curvaton and 
variable decay models, ghost inflation, models with higher derivative operators and the DBI model 
of inflation. We define a cosine between the distributions using a measure based on the ability of 
experiments to distinguish between them. We find that models fall into two broad categories with 
fairly orthogonal distributions. Models where non-Gaussianity is created at horizon-crossing during 
inflation and models in which the evolution outside the horizon dominates. In the first case the 
3-point function is largest for equilateral triangles, while in the second the dominant contribution 
to the signal comes from the influence of long wavelength modes on small wavelength ones. We 
show that, because the distributions in these two cases are so different, translating constraints on 
parameters of one model to those of another based on the normalization of the 3-point function for 
equilateral triangles can be very misleading. 



1 Introduction 



Spectacular experimental observations in cosmology caused a certain optimism about our knowledge 
of the very early Universe. The results are sometimes described as a confirmation of the standard 
slow-roll inflation paradigm, but this can be rather misleading. What we really know is that all 
observations are compatible with a scale invariant spectrum of adiabatic perturbations with Gaussian 
statistics and that these perturbations exist outside the horizon at the time of recombination. These 
facts are too generic to be considered a proof of the standard picture and in fact non-minimal 
scenarios or even radically different proposals are still compatible with the data. 

The situation will likely change in the near future. There are three basic observables which we 
consider the most relevant both to confirm or rule out the minimal slow-roll scenario. The experi- 
mental limits on all these parameters are getting close to the interesting range, where a distinction 
among different proposals is possible. 

Tilt of the scalar spectrum. A quite generic prediction of slow-roll inflation is the deviation 
from a completely flat spectrum. This prediction is a consequence of slow-roll itself and it is therefore 
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rather robust. Although the precise number is model-dependent, in most models \n — 1| is of order 
1/Ne, where is the number of e-folds to the end of inflation when relevant scales exit the horizon. 
Present limits are of order |n — 1| < 0.05 {e.g. 0111 El), so that we are entering in the interesting 
region. A deviation from a flat spectrum would strongly support the slow-roll inflation picture and 
it would allow to distinguish it from 'ghost inflation' |^ for example, where |n— 1| is expected to be 
negligible. However, if no tilt is detected slow-roll inflation cannot be safely ruled out: it is easy to 
build models with a tilt as small as we like. 

Gravity wave (GW) contribution. The contribution of GWs is directly related to the value 
of the Hubble constant H during inflation. The detection of a GW signal would therefore point 
towards models with big vacuum energy (V^^^ > 10^^ GeV). Inflationary models fall into two broad 
categories. Models with small vacuum energy (which is equivalent to a very small e, e <C 1/-/Ve; as 
H / [Mp^) is fixed by the spectrum normalization) with totally negligible productions of GWs and 
models with big vacuum energy (usually with e ~ r] ~ l/Ng), where the GW contribution should be 
close to the present experimental limit, r < 0.5 {e.g. 012) . The distinction is quite sharp because 
the two categories can also be distinguished by the variation of the inflaton field during inflation: 
much smaller than the Planck scale in the first case, comparable to the Planck scale in the second. A 
possible criticism against models with a sensible production of GWs is that a variation of the inflaton 
field much bigger than Mp seems out of control of the effective field theory [31 . Extra dimensional 
UV completions provide examples in which this is not true 0. On the other hand models with 
very small e have been considered unnatural as they require a hierarchy between the two slow-roll 
parameters e <C [3. Experiments in the near future will distinguish between the two possibilities. 
A detection of a GW signal would be of great support for the simplest slow-roll inflation scenario. 
GWs are in fact usually negligible in models where additional light fields are responsible for density 
perturbations 00 Ell) (^^^ however rXT) in the ekpyrotic/cyclic scenario jl2j and in ghost inflation 
i- 

Non-Gaussianity. The third observable, which is the main subject of this paper, is the de- 
viations from a pure Gaussian statistics, i.e. the presence of a 3-point function^. There are two 
reasons why the study of the 3-point function is relevant. First of all, in a conventional single field 
model of inflation, the 3-point function can be explicitly calculated as a function of the slow-roll 
parameters |13( I14j. It turns out to be very small: the primordial fluctuations are Gaussian up to 
a level of 10~^ (dimensionless skewness), which is beyond what we can measure in the near future. 
Any deviation from this prediction is therefore a clear sign of departure from the simplest picture. 
The 3-point function therefore appears as the optimal smoking gun for many possible scenarios: 
additional light fields besides the inflaton, imprints of heavy physics through higher dimension op- 
erators, ghost inflation, etc. Conversely, if no significant level of non-Gaussianity is found, this will 
favor the simplest scenario. Also the ekpyrotic and cyclic scenarios, disregarding the open issue of 
matching the perturbations across the bounce, can give an extremely Gaussian spectrum jl5j . 

Another reason why the detection of the 3-point function would be very exciting is that it 
potentially contains a lot of information. The 3-point function of the curvature perturbation in 
momentum space 

depends on 2 parameters which characterize the shape of the (fci, A:2, /^s) triangle, while the de- 
pendence under rescaling of the triangle is fixed by scale invariance^. The purpose of this paper 

"'^The presence of any connected n-point function {n > 2) indicates a deviation from a perfectly Gaussian 
signal. We concentrate on the 3-point function as it is much bigger than the others in all the model we 
consider. 

^Given the limits we have on the scalar tilt, scale invariance is a very good approximation. 
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is to show that this function of two parameters contains a lot of information about the source of 
non-Gaussianity and that it could be useful to distinguish among different models. Moreover we 
will study how the experimental limits, which are given assuming a particular form of the 3-point 
function, change if we modify the shape dependence of 

There are in principle other observables which could turn out to be relevant, like for example 
an isocurvature contribution in the perturbations. Unless a conserved quantity such baryon number 
prevents it, thermal equilibrium is capable of erasing any isocurvature fluctuations imprinted early 
on, so that it is rather difficult to get generic predictions. Therefore only for the three observables 
described above we are confident to enter, with the experimental progress in the next few years, in 
an interesting range. Whatever the results turn out to be we will get further insight into the early 
cosmology. 

In section 121 we will describe the general features of the 3-point function in different models and 
underline the qualitative differences. In section |31 we plot the different functions and we quantify how 
"orthogonal" two distributions are. In section^ we study the effect of projecting the 3d space into 
a 2d Cosmic Microwave Background (CMB) map and how experimental limits on non-Gaussianity 
change taking a different shape dependence. We leave to the appendix a discussion about the 
approximation we used in the 2d analysis. Conclusions are drawn in section El 



2 Shape dependence in different models 

Translational invariance forces the 3-point function to conserve momentum 

iCkSk^CkJ = (2^)'^( E h)Hki,k2,h) , (2) 

i 

while scaling invariance implies that the function F, symmetric in its arguments, is a homogeneous 
function of degree —6 

F{Xh, Xk2, Xh) = X-'^F{h,k2, h) . (3) 

Rotational invariance further reduces the number of independent variables to just 2, for example 
the two ratios k2/ki and k^/ki. Note that the function F is real, because the 3-point function in 
position space cannot change if we change sign to all coordinates. 

One interesting form for the function F is the one usually assumed for the analysis of the data 
(see e.g. ^El)- The quantity we observe C is not Gaussian but it contains a non-linear "correction" 

C(x) = C,(x) - IhdCgixf - {Q) , (4) 

where Cgi^) is Gaussian. Experimental limits are usually put on the scalar variable /nl O- The 
most stringent limit comes from the WMAP experiment ^H] 

-58 < /nl < 134 at 95% C.L. (5) 

If we go to Fourier space, eq. (jlj) implies a function F of the form 

i^local(^l,^2,4) = 2(27r)4(-^/NLP|) • ^ , (6) 

•^The 3/5 is introduced so that /nl parametrizes the aniplitude of the non-Gaussian departures of matter- 
era gravitational potential on the large scales. 
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where Pti is the amplitude of the power spectrum. Currently the best constraint on its amplitude 
comes from the CMB anisotropy measurement by the WMAP satellite, P^"^ ~ 4.3 x 10~^ pQ. 

Although originally taken as a simple ansatz, this shape dependence turns out to be physically 
relevant for many models which predict a sensible non-Gaussianity. The reason is that eq. 
describes (at leading order) the most generic form of non-Gaussianity which is local in real space. 
This form is therefore expected for models where non-linearities develop outside the horizon. This 
happens for all the models in which the fluctuations of an additional light field, different from the 
inflaton, contribute to the curvature perturbations we observe. In this case non-linearities come from 
the evolution of this field outside the horizon and from the conversion mechanism which transforms 
the fluctuations of this field into density perturbations. Both these sources of non-linearity give a 
non-Gaussianity of the form Q because they occur outside the horizon. Examples of this general 
scenario are the curvaton models ^ , models with fluctuations in the reheating efficiency [HI Ell 
multi-field inflationary models (^). 

Being local in position space, eq. ® describes correlation among Fourier modes of very different 
k. It is instructive to take the limit in which one of the modes becomes of very long wavelength 
|13j . ^3 — > 0, which implies, due to momentum conservation, that the other two /c's become equal 
and opposite. The long wavelength mode freezes out much before the others and behaves as 
a background for their evolution. In this limit -Fiocai is proportional to the power spectrum of the 
short and long wavelength modes 

-^^locai 0^ pp • (7) 

This means that the short wavelength 2-point function (Cg^C.^^) depends linearly on the background 
wave 

^ d 

^^k3 

From this point of view we expect that any distribution will reduce to the local shape © in the 
degenerate limit we considered^, if the derivative with respect to the background wave does not 
vanish. 

In standard single field slow-roll inflation the limit /ca — > is quite easy to predict. As pointed out 
by Maldacena jl3j . different points along the background wave are equivalent to shift in time along 
the inflaton trajectory, so that the derivative with respect to the background wave is proportional 
to the tilt of the scalar spectrum. This can be explicitly checked in the full expression of the 3-point 
function jl3j 



Fst.ndikl,k2,k3) = li2Tr)^Pl ^ 



(9) 



where e and rj are the usual slow-roll parameters and kt ^ ki + k2 + k^. In the limit ^3 — > eq. @ 
goes as 

i^stand(^3 ^ 0) oc 2(r? - 3e)pp = (n, - l)-^ -3 . (10) 

1 2 1 2 



"^In these niodels additional contributions to F not of the local form ^ can be present; they describe non- 
Gaussianities generated at horizon crossing. Nevertheless the local contribution is dominant because it has 
time to develop outside the horizon for many Hubble times before the final conversion to density perturbations 

nHi. 

^The derivative with respect to the background cannot depend on the relative orientation of fci and fca, 
because this would need a derivative acting on the background giving a subleading contribution in the limit 
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As expected the tilt in the spectrum Ug fixes the degenerate limit of the 3-point function. Note 
however that expression Q is not of the local form © but contains contributions which are impor- 
tant for non-degenerate triangles. If we compare expression @ and © and neglect the different 
shape dependence, we see that standard single-field inflation predicts /nl of order of the slow-roll 
parameters. 

We have seen that the degenerate limit ^ describes the effect of a slowly- varying background 
wave on the 2-point function. In many models the correlation is much weaker in this limit than in 
the local case Physically this means that the correlation is among modes with comparable 
wavelength which go out of the horizon nearly at the same time. In this case the 3-point function 
in the degenerate limit is suppressed by powers of ^3 with respect to the behaviour of eq. 0. We 
have correlation among modes of comparable wavelength in all models in which the non-Gaussianity 
is generated by derivative interactions: these interactions become exponentially irrelevant when the 
modes go out of the horizon because both time and spatial derivatives become small, so that all the 
correlation is among modes freezing almost at the same time. 

One example of this kind of models is obtained if we add higher derivative operators in the usual 
inflation scenario; the leading operator of this form is 
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where (f) is the inflaton. It is straightforward to calculate the 3-point function after the addition of 
this operator ^Hl- The result is 
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FKdXki,h,k3) =-(27r)^P^^. 



kf + fej 3kfk'j)+ (kfkjk, 



I - Akmki) 



(12) 



The ratio where is the velocity of the inflaton, is expected to be less than one in the 

regime in which we can trust an effective field theory description with cut-off A and neglect the 
infinite set of higher dimension operators. Therefore the effect cannot be too big: comparing the 
previous expression with eq. © and neglecting the shape dependence we expect roughly /nl ^ 1, 
unless we want to enter into the regime where higher order corrections are unsuppressed. It is easy 
to check that the expression in brackets in eq. H12|) vanishes as k^ in the limit ^3 ^ ^Uj. The 
correlation is therefore highly suppressed in the degenerate limit with respect to the local shape ©: 
the additional powers of k^ come from the derivatives in the operator Hll() acting on the background 
wave. The correlation is among modes of comparable wavelength, because the higher derivative 
interaction vanishes exponentially outside the horizon. 

A model of inflation based on the Dirac-Born-Infeld (DBI) action has recently been proposed 
j2()| I21j. This model predicts significant non-Gaussianities and gravity waves. The predicted form of 
the 3-point function is the same as in equation (fT^ . but the level of non-Gaussianity is much bigger 
because higher derivatives terms are crucial for the inflaton dynamics. 

As a final example we consider the 3-point function predicted by 'ghost inflation'. Without 
entering into the details of the model we stress that also in this case the 3-point function is 
generated by a derivative interaction, so that we expect the same qualitative behaviour than in 
the previous example, with a substantial level of non-Gaussianity. The explicit form of the 3-point 
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function is not illuminating 

Fgho.t(fci,fe2,4) =2^2 vr^Vio r(i)'/' /3a-8/5p^/'. (13) 
• Re dii ri-'F*{ri)F* (^^t?^ F'* (^^t]^ ^3(^1 • ^2) + symm. 

where the contour of integration is oc (— 1 — i), a and (3 are unknown order one coefficients and 

It can be checked that in the limit /ca — > the integral goes like A;|. As in the previous example the 
correlation is therefore suppressed for modes with very different wavelength. 

As the explicit expressions of the 3-point functions we showed have progressively become more 
and more complicated, in the next section we show the explicit plots of the functions, so that their 
behaviours and differences can be better appreciated. 

3 A 3D comparison 

Imagine that we measure the density perturbation in a 3-dimensional survey. We assume that 
the 3-point function is of the form 

(Cfc,C,-,C,-3) =^-(2vr)3^(E^0^(^i'^2,fe3) (15) 

i 

and we want to use the data to measure the overall amplitude A (^). It is easy to check that the 
best estimator for A, in the limit of small non-Gaussianity, is 

Eg,i^(fel,^^2,fe3)V««) ' 

where (t| is the variance of a given mode and the sums run over all triangles in momentum space. 
This is the estimator with the least variance. Expression (|16|) naturally defines a scalar product 
between two distributions Fi and F2 

F,.F2 ^ ^Fi(^i,fc2, ^3)^2(^1, fc2,^3)/(«CT23) . (17) 

hi 

Its intuitive meaning is clear: if two distributions have a small scalar product, the optimal estimator 
H16|) for one distribution will be very bad in detecting non-Gaussianities with the other shape and 
vice versa. We will be more quantitative below. But first of all we want to use this scalar product 
to make meaningful plots of the different shapes we described in the previous section. 

As we discussed the function F depends on only two independent variables. We choose them 
to be X3 = k^/ki and X2 = k2/ki and we further assume X3 < X2 to avoid considering the same 
configuration twice. The inequality X3 > 1 — X2 follows from the triangular inequality. Looking at 
eq. (|17j) we see that in the definition of the scalar product there is a factor x^Xg coming from the two 



^We should choose some standard normalization for F to give sense to the overall amplitude A. We will 
come back to this point later. 
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spectra in the denominator which are approximately scale-invariant. Furthermore a measure 

is required to go from the 3D sum over modes to the integral over X2 and 2:3. We conclude that the 

most meaningful quantity to plot is 

F{l,X2,X3)xlxl, (18) 

so that the integral of the product of two functions we plot gives directly the scalar product. 

In figure ^ [2 01 and \^ we show the shape dependences discussed in the previous section. To 
avoid showing equivalent configurations twice, the function is set to zero outside the triangular region 
1 — X2 < X3 < X2- In the first two figures we see, as expected, that the "signal" is concentrated on 
degenerate triangles X3 ~ 0, 0:2 — 1, while in the same configuration the third and fourth plots are 
suppressed. In these two cases the correlation is bigger among modes of comparable wavelength, i.e. 
equilateral configurations X2 — 3:3 ~ 1. 

We want to be quantitative about the shape difference of the distributions. From the scalar 
product (|17() we can easily define the cosine between two distributions 

F F 

which will be a number between —1 and 1 (^), which tells us how orthogonal two shapes are. If the 
cosine deviates sensibly from 1, the distinction between two shapes is easy, assuming that a 3-point 
function has been detected. We numerically calculated the cosine between the 3-point functions we 
discussed with respect to the local distribution, usually assumed in the data analysis. The results 
are given in tabled We see, as expected, that the distributions given by higher derivative terms and 
ghost inflation are not "collinear" with the local distribution: the cosine deviates significantly from 
1. The distribution predicted by the conventional slow-roll scenario is on the other hand quite close 
to the local distribution, unless n — 1 = 2(ry — 3e) = in which case the spectrum is scale invariant 
and the 3-point function looks quite similar to models with derivative interactions. In going from 
positive to negative tilt the 3-point function changes sign and close to the transition the cosine with 
the local model is close to zero. 

There is another interesting quantity we can calculate to compare different distributions. For 
the local distribution we can take A = /nl in eq. (|15|) and normalize all the other distributions 
at the equilateral configuration. For every distribution we will have an overall amplitude /^l^'' ) 
which can be directly compared to the local case for an equilateral configuration. Imagine now that 
a 3-dimensional set of data is used to get a limit on /nl for the local distribution. How can we 
translate this into a limit for /^x^'' for another distribution? We can define a "fudge factor" which 
converts limit from /nl to f^^^' for another shape dependence. From eqs. (fT6|) and (|T7|) we easily 
obtain that the fudge factor / is 

f{F) ^ / • . (20) 

-f^ local ■ local 

The limit on /^l"''' of a given distribution will be the usual /nl parameter divided by f{F). Obviously 
this procedure is not optimal, because we are using an estimator ((TB)) appropriate for the local 
distribution to set limits on a different angular dependence. Anyway it is an easy and fast way to 
get approximate limits without doing the full analysis with a new shape dependence. The fudge 
factors for the different distributions are given in table ^ We see that the fudge factor is much 
smaller than 1 for the distribution generated by higher derivative terms and for ghost inflation. 



''Obviously the sign is irrelevant as it can be switched by changing the sign of one of the F^s. 
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Figure 1: Plot of the function F{1, X2, xs) X2X3 for the local distribution ©. The figure is 
normahzed to have value 1 for equilateral configurations X2 = X3 = 1 and set to zero outside the 
region 1 — X2 < X3 < X2- 




Figure 2: Plot of the function F(1,X2,X3) for the usual slow-roll inflation ® with e = r] = 
1/30. The figure is normalized to have value 1 for equilateral configurations X2 = x^ = 1 and set to 
zero outside the region 1 — X2 < x^ < X2- 



It is interesting to rewrite the definition of f{F) as 



F ■ -^local . „ „ . I F ■ F 



f{F) = / ^ = cos(F, Fiocai) ( ^ ^ ) ■ (21) 

^ local ^ local q \ local local / 



1/2 



Figure 3: Plot of the function F{1,X2, x^) x'^x'^ for non-Gaussianities generated by higher derivative 
interactions H12|) and in the DBI model of inflation j2Ul I21j . The figure is normalized to have value 
1 for equilateral configurations X2 = X3 = 1 and set to zero outside the region 1 — X2 < X3 < X2- 



Ghost inflation 1 




Figure 4: Plot of the function F{l,X2,xs) x^x"^ for ghost inflation H13|). The figure is normalized 
to have value 1 for equilateral configurations X2 = X3 = 1 and set to zero outside the region 

1 - X2 < X^ < X2. 



We see that the fudge factor is proportional to the cosine between the distributions. This suppression 
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Distribution 


3d Cosine 


3d Factor 


2d Cosine 


2d Factor 


Ghost 


0.33 


0.06 


0.52 


0.16 


Higher Deriv. 


0.45 


0.10 


0.64 


0.24 


e = 1/30, 7] = 1/30 


0.99 


0.73 


0.99 


0.76 


e = 1/300, 7] = 1/30 


1.00 


1.12 


1.00 


1.11 



Table 1: The 3d and 2d Cosines and Fudge Factors for several different primordial distributions. 
The values for the DBI model coincide with those of the higher derivative distribution. In the limit 
e — > the standard inflation distribution reduces to the local distribution: cosines and fudge factors 
goes to 1. The 2D numbers are obtained using instrumental noise and band width appropriate for 
WMAP. 

can be eliminated using an optimal estimator for the distribution of interest. The other factor is the 
ratio between the two norms (i.e. the overall signal) once the functions are normalized at the same 
value for equilateral triangles; this cannot be changed by the analysis. Obviously for the distributions 
of fig. © and (HJ) this ratio is quite suppressed as evident from the plots. That explains the smallness 
of the fudge factors for these two distributions. For example for the ghost model the suppression is 
a factor of 16, where approximately a factor of 3 comes from the cosine and a factor of 5.5 from the 
ratio of norms. 

Finally we want to mention the fact that non-linear evolution of modes inside the horizon also 
creates non-Gaussianity in the density field. It can be observed for example in the local distribution 
of galaxies (see |221 for a review). These non-Gaussianities are not scale invariant, a fact that can 
be used to separate them from the primordial contributions [22] . As an example in figure 13 we 
show the shape of the three point function of the density for k = 0.1 h Mpc~^ for a qualitative 
comparison with our previous figures of primordial non-Gaussianities. These non-Gaussianities are 
not only scale dependent, but they also have a different dependence on the triangle shape. The 
density 3-point function peaks for collinear configurations, that is when the three wavevectors are 
parallel. This happens because gravity generates density and velocity-divergence gradients that are 
parallel to the velocity flows 

4 A 2D comparison 

Non-Gaussianity in the CMB. The initial curvature perturbations produced during inflation 
cause corresponding fluctuations in the matter species in the universe. These fluctuations, after 
being modifled by gravitational and hydrodynamical evolution, produce anisotropics in the CMB. 
Therefore non-Gaussian statistics of the CMB can be used to constrain the non-Gaussian statistics of 
the underlying perturbations. Most of the fluctuations that we observe in the CMB were imprinted 
at the epoch of last scattering, around a redshift of z ~ 1100. Since the fluctuations were very small 
at that time it is possible to calculate the radiative transfer of the CMB using linear theory. In 
this regime any non-Gaussianity in the CMB will be directly related to primordial non-Gaussianity. 
In reality there are non-linear corrections to the gravitational and hydrodynamical evolution which 
will produce non-Gaussianities even if the primordial perturbations are Gaussian. One expects such 
corrections, being of second order in the perturbation, to produce an equivalent /nl ~ 1- This 
is below the current experimental limit but non-linear effects might become important for future 
experiments. We will ignore them in what follows. 

The correspondence between the non-Gaussianities in the CMB sky and those of the primordial 
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Figure 5: Shape dependence of the density 3-point function for k = 0.1 h Mpc^ . The signal is 
largest for collinear triangles, when the three wavevectors are parallel. These fluctuations are not 
scale invariant so both the amplitude and details of the shape are functions of the scale. We present 
results for a particular wavelength for illustrative purposes. 



curvature is not direct because of the gravitational and hydrodynamical evolution before the epoch 
of last scattering. Moreover with the CMB one maps a 2-d surface of the universe. The temperature 
on this surface is a projection of the 3-d curvature perturbations in the surface's vicinity. This fact 
introduces further complications if we are interested in the k dependence of the primordial 3-point 
function, not just in its amplitude. In a CMB map one can measure only the components of k 
that are parallel to the plane of the sky, but not the perpendicular one. As a result a measurement 
of the 3-point function of the CMB temperature for a triangle of one particular shape, will receive 
contributions from 3-d triangles with a variety of shapes. Moreover the perturbations in the CMB are 
no longer scale invariant. The evolution inside the horizon imprints several scales in the spectrum, 
like the scale of the sound horizon or the scale of photon diffusion. These departures from scale 
invariance, though mild, complicate the analysis. 

CMB statistics. As for the curvature fluctuations in 3-d, we will study the 3-point function 
of the CMB temperature in Fourier space. We will follow the notation of 125]. The temperature 
fluctuations on the sky are expanded in spherical harmonics. 



We consider the 3-point function (aiimifli2m2^«3m3) and, assuming rotational invariance, construct 
the angular averaged bispectrum 




(22) 



Im 




(23) 
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As we did for the curvature perturbations, we can now define a scalar product in 2-d 

Bi-B2 = ^ Bi{li,l2,h)B2{h,l2,h)/{fh,h,h^hChCh) ^ (24) 

where fi^,i2,h ^ combinatorial factor equal to 1 if the three Ts are different, to 2 if two of them are 
equal and to 6 if all of them are equal. The noise in the denominator of ()24() has been calculated 
in the Gaussian limit and includes instrument noise and beam width in the standard way [251 126j . 
The dot product can be used to define a 2-d cosine, 

C0s(5i,i?2) = -, TTTTT ! (25) 

^ ' ' {Bi- BiY/^{B2- B2Y/^ 

and a 2-d fudge factor, 

-Dlocal • -Dlocal 

Calculation of CMB Bispectra. The temperature anisotropies on the sky are linearly related 
to the underlying curvature perturbations. The contribution to the temperature fluctuations at 
multipole / from a curvature fluctuation with wavenumber k is encoded in the radiation transfer 
function Af{k). In particular we have, 

(^(^(^^:™.(^i)>i:™.(^2)i'/:™3(^3) (27) 



:(2^)35(3)(^^^)i.(fc^,A;2,A;3)A^(A:i)A;;;(A;2)A^(A:3) • 



The radiation transfer function Af{k) can be calculated with publicly available software such as 
CMBFAST 29 . Expressing the 5 function as an exponential and expanding it in spherical harmonics 
and Bessel functions we get, 

{ai,niiai2m2ahm3) = J 2^iJ^^^ '^hdh 2k^dks J ^2^y*^^{x)YiI^^{x)YiI^^{x) (28) 

oo 

X j x^dx ji^{kix)ji2{k2x)ji,^{k3x)F{ki,k2,k3)Al{ki)Al{k2)Al{k3) . 



In three dimensions, translation invariance forced the three /c's in the 3-point function to add to 
zero. In 2-d the equivalent constraint of rotational invariance in enforced by the Gaunt integral 



(2/i + l)(2/2 + l)(2/3 + 1) f h h h\( h h I 



t3 



47r \ y \ mi m2 

It forces li, I2 and /a to satisfy a triangle inequality. The integral 



(29) 



00 



determines the strength with which a 3d triangle contributes to a 2d triangle. When considering 
the 2-point function, the equivalent of eq. (|29|) is Si^^i26mi,m2 ^-i^d sq. (|30|) becomes proportional to 
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6^^\ki — k2)- Using these definitions we can write the reduced CMB bispectrum in a more convenient 
form 



mul2,k) =y ^ ^0 qJ (31) 

J TT TT TT 

Numerical Challenges. The evaluation of equation (|.Slj) is numerically very challenging. It 
not only involves a four dimensional integral, but both the Bessel functions in eq. (|3U|) and the 
radiation transfer functions in eq. H31|) are very rapidly oscillating. For example the radiation 
transfers function oscillate when k changes by order the inverse of the distance to the last scattering 
surface, Ak ~ 1/c?lss- Moreover the contribution to multipole / comes preferentially from modes 
with wavenumber k ~ l/di^ss- Thus for / ~ 1000 the radiation transfer functions have many 
oscillations in the k range of interest. We can estimate that in order to even crudely calculate the 
integral in equation (|31j) the transfer function needs to be evaluated in 200 values of k. Since there 
are three integrals over k the required number of evaluations is roug hly (200)3 ~ 10"^. In addition 
there is the integral over the three spherical Bessel function, eq. (|30|) . which is extremely difficult 
to evaluate numerically due to the oscillatory nature of its integrand. Let us assume that it can be 
done with 10^ operations. We need to calculate these integrals roughly l"^^^ times because of the 
sum which appears in the dot product (|25|) . For Imax ~ 1000 this is 10^. We must perform roughly 
10^^ operations to compute the dot products by brute force. 

It is clear that an alternative way to evaluate these integrals must be developed. For the particular 
case of the local distribution, i*iocai in eq. © can be expressed as a product of functions of ki, /c2 and 
^3 separately and the integral in equation (|31|) can be split and done more easily |25j . We cannot 
do this in general because some of the distributions we are considering depend of kt and cannot be 
f actor ized. 

In the appendix we present a technique to evaluate equation (|^T|) using the flat sky approxima- 
tion. Under this assumption a simple change of variables can eliminate most of the oscillations in 
the transfer functions and the integral can then be evaluated numerically with little effort. 

Results. The results of cosines and fudge factors with respect to the local distribution are 
listed in Table ^ We used the instrumental noise and beam width appropriate for the WMAP 
experiment even though we do not expect big differences for other experiments. The first two 
entries are for ghost inflation and for the 3-point function generated by higher derivative operators 
(which coincides with the one in the DBI model). We also show the results for the standard slow-roll 
inflation distribution calculated for different values of the slow-roll parameters. 

Tabled shows that the cosines between distributions calculated for an ideal 3-D experiment and 
those calculated for a CMB map are quite similar. That is to say, models that are distinguishable in 
3-D are also distinguishable in a 2-D survey. For a given 2-D Fourier mode, the components parallel 
to the plane of the sky of the 3-D modes that contribute to it are fixed, A?" = l/d^ss- However 
Fourier modes with all possible values of the wavevector component perpendicular to the plane of 
the sky k± can contribute. As a result one expects that the configurational dependence of the 3-point 
function in 2-D be somewhat washed out relative to the 3-D case. Triangles with all different shapes 
in 3-D contribute to a given triangle in 2-D. This effect however is rather mild as evident in Table 
n The fact that the primordial bispectrum is scale invariant, i.e. its amplitude is proportional to 
k~^, implies that the dominant contribution comes from modes with k± ~ 0; the information on 
the shape is conserved. In a sense this is the same reason why we see acoustic peaks in the power 
spectrum. In that case one could also argue that modes with different wavenumbers k contribute to 
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any given I and thus the acoustic oscillations in the 3-D transfer function would be washed out in 
the temperature power spectrum. Clearly that only happens to a small degree. 

From the table we can infer how the constraints on /nl from the WMAP 1-yr data convert to 
limits on /^*^''' for different distributions. For example the allowed interval 

-58 < /nl < 134 at 95% C.L. (32) 

is approximately degraded to 

-360 < /X"' < 840 at 95% C.L. (33) 

for the ghost model. The suppression is roughly a factor of 6. A factor of 3 comes from the difference 
in norms {i.e. the overall signal once the functions are normalized at the equilateral configuration) 
and a factor of 2 from the cosine. This last piece could be eliminated by optimizing the data analysis. 



5 Conclusions 

Deviations from Gaussianity could become a very important probe of the early universe physics, 
responsible for the density inhomogeneities we observe in the universe today. The minimal slow- 
roll model of inflation predicts negligible non-Gaussianities (at the level of 10~^) but many of the 
alternatives predict levels substantially larger. 

The discovery of a 3-point function could provide substantial additional information on the 
mechanisms that generated the non-Gaussianities through its dependence on triangle shape. In 
this paper we studied that dependence for some of the best motivated alternatives and quantified 
the observability of the differences in shapes. We concluded that there are broadly two classes of 
shapes for the 3-point function. In models such as ghost inflation, the DBI model or when there 
is significant imprint from heavy physics through higher derivative operators, the non-Gaussianity 
peaks for "equilateral-type" configurations. For models where the non-Gaussianities are produced 
outside the horizon the shape of the 3-point function peaks in the collapsed triangle limit, when 
the wavelength of one of the modes is much larger than that of the other two. This limit is well 
described by the local model. 

We quantified these differences by introducing a cosine between the distributions with a measure 
based on the signal to noise, that is the ability of experiments to distinguish the different shapes. 
We found that the cosines between distributions are very similar for CMB experiments and ideal 3D 
experiments, where the full gravitational potential is mapped in three dimensions. We found that 
the cosine between the local model and the "equilateral-type" models is around 0.3-0.4 in 3D and 
0.5-0.6 in 2D. That is to say, the two distributions are quite orthogonal. 

The low cosine means that data analysis techniques optimized for one distribution are not optimal 
for the other. Setting constraints on the local model is computationally much simpler than for the 
other examples as a result of various tricks developed in the literature j2Hl- Our results suggest that 
to fully exploit available and future data one should find ways of extending existing techniques to 
apply for other 3-point function shapes. 

When comparing different models of non-Gaussianity it has become fairly standard to normalize 
them to have equal amplitude at equilateral configurations. We showed that because the local model 
has a fairly small signal there while the "equilateral-type" models peak for these configurations, using 
this method to read off limits for one model based on constraints on another can be quite misleading. 
For example we found that the constraints on the ghost-inflation model are significantly relaxed. This 
is mainly because when normalized at the equilateral configurations, the ghost model is significantly 
less non-Gaussian than the local model, and to a lesser extent because the data analysis is not 
optimized for the ghost-inflation distribution. 
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Appendix 

In this appendix we present the approximated method we used to calculate the 2d cosines and fudge 
factors. We will start with the integral solution of the brightness equation |29j . 



AT r d^h ^ /■'^o 

-jT^n) = j dTe^'-^^^^'~^^S{k,T) , (34) 

where AT(h)/T is the fluctuation in the CMB temperature in the h direction, S{k,T) is the CMB 
source function. The source function encodes the effects of the metric perturbations and photon 
fluctuations, through the integrated Sachs-Wolfe effect, Doppler effect, gravitational redshift, etc., 
on the observed CMB. 

In the flat sky approximation one considers directions very close to some fiducial direction, and 
ignores the curvature of the sky taking n to lie in the plane perpendicular to the fiducial direction. 
This is equivalent to approximating the sphere in a neighborhood of a point by the tangent plane 
at that point. In this limit the equivalent of spherical harmonic transformation becomes simply a 
Fourier transform. We have 

a{l) = I d^ne-^'-'^^in). (35) 

We can separate the exponential term in the line of sight integral into two pieces that depend of 
the wavevectors parallel and perpendicular to the tangent plane, 

a{l)= j -^^({k) rdTS{k,T) [ ci2^e-*'-^e^^"-^("o-")e^'='("o-") . (36) 



(2vr) 







Evaluating the integral over n we recover a 2D 5 function that requires I to be equal to the projected 
wavevector kW times the distance to the last scattering of the observed photon, 

■^Cik) dTS{Kr)e'^''^^^-^\2^)H\l- k^r^ - r)) . (37) 

This approximation will break down when the tangent plane needed to define a mode with wavenum- 
ber I has large deviations from the surface of the sphere that defines the last scattering surface (LSS), 
that is to say when considering large angular scales. Note that we have not assumed that recom- 
bination happens instantaneously. Although the source function is strongly peaked around the 
decoupling time tr, it has a width 6tr which for our purposes cannot be ignored. 

We separate the phase e*'^^^'^""'^^^ in eq. (|37|) . this factor will cancel when we look at n-point 
correlation functions of a{l) because of momentum conservation. We have 

^^ik^ro-rn) J ^2j^\\^^j^^ J dTSik,T)e'''^^^^~^^5\l-k^To-T)) . (38) 
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This allows us to define a radiation transfer function as, 

Jo 

Comparing eq. to the all sky formula [2^1 

Afik) = HdrSik, t)3i [fc(ro - r)] , (40) 



we can see that spherical Bessel function encapsulates both the 3D to 2D map of the 6 function and 
the oscillation present in the exponential factor e*^^('^"~'^^\ 

Now we can calculate the bispectrum by taking the ensemble average of a product of three a(/), 

^ ^ ^ I' r d^ki d^^ko d^k-} ^ 

{a{h)a{l2)a{k)) = J dndr^dr, J --i_^_l(C(A;i)C(A;2)C(A:3))S(fci, n)5(A:2, r2)5(fc3, rg) 

^ei{rH-r^)kf^^{r^-r,)ki^i{ra-r,)k|^2^l'^ _ _ ^^)^s\l2 " ^ (^0 " T2))6\h " fc| (tq " Tg)) . (41) 

Now we use (C(^i)C(^2)C(^3)) = (2vr)^5^(fci23)i*'(A:i, ^21 ^3) and assume that 6k^^ ■ V kF{ki,k2,k^) is 
small, where J/c" is the variation in fc" for a given / as the tangent plane sweeps across the width of 
the last scattering surface. It is clear from geometry that J/cl'/A;" will be order Stu/tr ~ 10~^. The 
3-point functions we are considering do not have sharp features so this assumption will allow us to 
use an average /c" in our evaluation of the primordial bispectrum without introducing a large error. 
This is equivalent to interchanging the line of sight integral and the integral over Fourier space and 
evaluating k^^ at //(tq — r^j), 

{a{h)a{l2)a{h)) = {tq - mfd^^^U^) j dktdkldkl5{kl^^)Fik[,k'^,k!,)A^ilukl)A'^il2,kl)A''ih,k^ 

(42) 

where k' means k evaluated such that A;" = //(tq — r/j) and 

A^il,k^)= . ^ . , SiVik^y + lyiro - t)\ ry^'^^n-r) (43) 
Jo {to - ry 

Using the definition for the bispectrum, {a{li)a{l2)a{ls)) = {27r)^S^'^\li23)B{li,l2,h) we get 
B{h,l2,h) = ^""""^^f^' I dA:^dA:|dfc|5«(A:f23)i^(A:;,A:^,fc^)A^(/i,A:J)A^(/2,A:|)A^(/3,fc|) . (44) 

This formula is completely generic, it can be used to calculate the flat sky bispectrum produced 
by any primordial perturbation field. It is much easier to handle numerically and was the basis of 
our evaluation of the dot products presented in Table ^ 

In the flat sky limit we have a 2D 6 function which forces the 2D modes to form a closed triangle. 
In the full sky calculations this is enforced by the Wigner 3j symbols of eq. ()29|). 
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